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DETAILED ACTION 

Continued Examination Under 37 CFR 1.114 

I 

1. A request for continued examination under 37 CFR 1.114, including the fee set 
forth in 37 CFR 1.17(e), was filed in this application after final rejection. Since this 
application is eligible for continued examination under 37 CFR 1.114, and the fee set 
forth in 37 CFR 1.17(e) has been timely paid, the finality of the previous Office action 
has been withdrawn pursuant to 37 CFR 1.114. Applicant's submission filed on 1/23/06 
has been entered. 

Response to Amendment 

2. Claims 1-57 have been canceled. 

3. Claims 58-73 have been newly added. 

.1" 

Claim Rejections - 35 USC § 101 

4. 35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the 
conditions and requirements of this title. 

5. Claims 72 and 73 are rejected under 35 U.S.C. 101 because the claimed 
invention is directed to non-statutory subject matter. 

Claims 72 and 73 are drawn to a "program" perse as recited in the preamble and 
as such is non-statutory subject matter. See MPEP § 2106.IV.B.1.a. Data structures 
not claimed as embodied in computer readable media are descriptive material perse 
and are not statutory because they are not capable of causing functional change in the 
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computer. See, e.g., Warmerdam, 33 F.3d at 1361, 31 USPQ2d at 1760 (claim to a 

data structure perse held nonstatutory). Such claimed data structures do not define 

any structural and functional interrelationships between the data structure and other 

claimed aspects of the invention, which permit the data structure's functionality to be 

realized. In contrast, a claimed computer readable medium encoded with a data 

structure defines structural and functional interrelationships between the data structure 

and the computer software and hardware components which permit the data structure's 

functionality to be realized, and is thus statutory. Similarly, computer programs claimed 

as computer listings perse, i.e., the descriptions or expressions of the programs are not 

physical "things." They are neither computer components nor statutory processes, as 

they are not "acts" being performed. Such claimed computer programs do not define 

any structural and functional interrelationships between the computer program and 

I 

other claimed elements of a computer, which permit the computer program's 
functionality to be realized. 1 



Claim Rejections ■ 35 USC § 103 

§ 

6. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: •> 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 
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7. Claims 58, 59, 61 , 62, 65, 66, 68, 69 and 72 are rejected under 35 U.S.C. 1 03(a) 
as being unpatentable over Grayson et al. (U.S. Pat. 5,963,217) in view of Taylor (U.S. 



6,983,424). 

an information presentation 



Pat. 6,424,935) and in further view of Dutta (U.S. Pat! 

As per claims 58, 65 and 72, Grayson teaches; 
apparatus and method, comprising: 

voice outputting means for carrying out voice synthesis based on text information 
including in send data received from a sending apparatus for sending send data 
including text information, and outputting obtained synthetic voice (host computer sends 
a stream of text and other data to the participant computer and performs text-to-speech 
synthesis for output, col. 4, lines 14-16 and 42-43); and 

displaying means for displaying, to a display device, speaker images imitating 
speakers of said synthetic voice (text generates lip positions, col. 4, lines 16-20); and a 
text string to be spoken by said synthetic voice in a text display form corresponding to 
each of said speaker images (text section where the text display form is the same for all 
characters, col. 8, lines 52-53). 

Grayson does not teach the second display means displays the text string in a 
different text display form, which enables to distinguish each of said speaker images, 
corresponding to each of said speaker images. \ 

Taylor teaches a system that displays text in different colors or fonts for different 
speakers (col. 3, line 66 to col. 4, line 14). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify the system of Grayson to use a different display form for each 
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speaker as taught by Taylor because this would allow the text for each speaker to be 
readily distinguished (col. 3, line 66 to col. 4, line 14). 

Neither Grayson nor Taylor teach including a need for hiding of the text 
information with the transmitted text information and if there is the need for hiding of the 
text string, displays the speaker image by placing it over the text string to hide the text 
string from user's view. 

Dutta teaches a system for displaying images and text on a display screen that 
makes a determination that text does not need to be displayed and scales the icon to be 
displayed in the area, hence placing it over the text area (col. 9, lines 29-40 and Fig. 10, 
elements 1 022, 1 024 and 1 028). | 

It would have been obvious to one of ordinary |kill in the art to modify the system 
of Grayson and Taylor to determine if there is a need Tor hiding the text string and if so 
displays the speaker image by placing it over the text string to hide the text string from 
user's view because, as taught by Dutta, it is desirable to scale the icons to fit the 
available area of the video screen (col. 2, lines 1-5). 

8. As per claims 59 and 66, Grayson teaches wherein said first displaying means 
displays a speaker image selected from said plurality of speaker images (context 
defines characters to display, col. 4, lines 5-13). 

Grayson does not teach a first retaining means for retaining display 
correspondence information showing a correspondence between each of a plurality of 
speaker images and the display form of a text string and said second displaying means 
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obtains a display form corresponding to said selected 'speaker image from said display 
correspondence information to display said text string in said display form. 

Taylor teaches a first retaining means for retaining display correspondence 
information showing a correspondence between each of a plurality of speaker images 
and the display form of a text string and said second displaying means obtains a display 
form corresponding to said selected speaker image from said display correspondence 
information to display said text string in said display form (sets flags to identify which 
text corresponds to which user and then sets colors or fonts accordingly, hence 
retaining the correspondence, col. 9, lines 50-65). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify the system of Grayson for a first retaining means for retaining 
display correspondence information showing a correspondence between each of a 
plurality of speaker images and the display form of a text string and said second 
displaying means obtains a display form corresponding to said selected speaker image 
from said display correspondence information to display said text string in said display 
form as taught by Taylor because it would allow the view of the text change along with 
the animated character hence making the system more enjoyable to the user. 
9. As per claims 61 and 68, Grayson does not teach the text display form 
corresponding to each of said speaker images in said second displaying means is at 
least any of the letter color, size and font. 



* 
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Taylor teaches the text display form corresponding to each of said speaker 
images in said second displaying means is the color (color or fonts, col. 3, line 66 to col. 
4, line 14). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify the system of Grayson so that the text display form corresponding to 
each of said speaker images in said second displaying means is the color as taught by 
Taylor because it would allow interface to be more lively hence making interaction more 
enjoyable. 

10. As per claims 62 and 69, Grayson teaches the second displaying means displays 
said text string with a predetermined positional relation with the position in which said 
speaker image is displayed (avatar is viewed in a defined place on border and the text 
is viewed in the center of the interface, Fig. 7). 

11. Claims 60, 63, 67 and 70 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Grayson in view of Taylor and in further view of Dutta and Hikawa 
(U.S. Pat. 6,424,944), cited by applicant. 

As per claims 60 and 67, Grayson teaches 

second retaining means for retaining context correspondence information 
showing a correspondence between said plurality of speaker images and the context 
(context information would be saved on site in order to send it to the participant, col. 4, 
lines 5-7); 
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selecting means for identifying the context of text information included in said 
send data in said receiving apparatus (state also sends a stream of text with context 
information, col. 4, lines 14-16); and 

selecting a speaker image corresponding to the identified context based on said 
context correspondence information, wherein said first displaying means displays a 
speaker image selected by said selecting means (context information defines 
characters in presentation, col. 4, lines 9-13). ^ 

Grayson, Taylor and Dutta do not teach nor specifically point out performing 
these operations for genres. 

Hikawa teaches the user selecting a genre of background music to be 
reproduced and selecting an image based upon this genre, col. 9, lines 4-8 and 41-44). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify the system of Grayson, Taylor and Dutta to teach using genre 
information as taught by Hikawa because it would allow the user to select different 
categories within the current application hence facilitating navigation for the user. 
12. As per claims 63 and 70, Grayson, Taylor and Dutta do not teach the text 
information included in said send data includes headline text and text of spoken 
contents, and said information presentation system further comprises third displaying 

means for displaying said headline text distinguishable from the text display by said 

% 

second display means. I 

i 

Hikawa teaches the text information included in said send data includes headline 
text and text of spoken contents (title and whole contents are displayed, col. 9, lines 21- 
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24), and said information presentation system further comprises third displaying means 
for displaying said headline text distinguishable from the text display by said second 
display means (name display field, Fig. 5, element 31a). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify the system of Grayson, Taylor and Dutta to have the text information 
included in said send data include headline text and text of spoken contents, and said 
information presentation system further comprising third displaying means for displaying 
said headline text distinguishable from the text display by said second display means as 
taught by Hikawa because it would give the user an easily visible abbreviated 
information about the text being read hence facilitating use. 

13. Claims 64, 71 and 73 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Grayson et al. (U.S. Pat. 5,963,217) in view of Taylor (U.S. Pat. 6,424,935) and in 
further view of Lowry (U.S. Pat. 5,946,002). 

Grayson teaches an information presentation apparatus and method, comprising: 
voice outputting means for carrying out voice synthesis based on text information 
including in send data received from a sending apparatus for sending send data 
including text information, and outputting obtained synthetic voice (host computer sends 
a stream of text and other data to the participant computer and performs text-to-speech 
synthesis for output, col. 4, lines 14-16 and 42-43); and 

displaying means for displaying, to a display device, speaker images imitating 
speakers of said synthetic voice (text generates lip positions, col. 4, lines 16-20); and a 
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text string to be spoken by said synthetic voice in a text display form corresponding to 
each of said speaker images (text section where the text display form is the same for all 
characters, col. 8, lines 52-53). 

Grayson does not teach the second display means displays the text string in a 
different text display form, which enables to distinguish each of said speaker images, 
corresponding to each of said speaker images. 

Taylor teaches a system that displays text in different colors or fonts for different 
speakers (col. 3, line 66 to col. 4, line 14). \ 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify the system of Grayson to use a different display form for each 
speaker as taught by Taylor because this would allow the text for each speaker to be 
readily distinguished (col. 3, line 66 to col. 4, line 14). 

Neither Grayson nor Taylor teach wherein said display means, if the speaker 
image and the text string are overlapped due to a restriction of a display area of the 
display device, downsizes the speaker image to display it so as not to overlap the text 
string. 

Lowry teaches an image animation system that displays both textual information 
and an animation image on the screen simultaneously wherein if the image and the text 
string are overlapped due to a restriction of a display area of the display device, 
downsizes the speaker image to display it so as not to overlap the text string (resizes 
the animation image to avoid overlapping the text, col. 5, lines 3-10). 



Application/Control Number: 09/967,004 Page 1 1 

Art Unit: 2655 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify the system of Grayson and Taylor wherein said display means, if the 
speaker image and the text string are overlapped due to a restriction of a display area of 
the display device, downsizes the speaker image to display it so as not to overlap the 
text string because, as taught by Lowry, without this functionality the text may "overrun 
into the area of animation window, causing either the translated text to be superimposed 
on animation window or to be hidden underneath animation window either of which is 
aesthetically unpleasing" (col. 4, lines 57-61). 



Conclusion 

14. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. Armga et al. (U.S. Pat. 6,390,371) also teaches a method for 
controlling the sizes of animation and textual windows to avoid overlapping. Dutta (U.S. 
Pat. 6,453,294) teaches a system for online chat capabilities including speech-to-text 

and text-to-animation. f 

I 

Any inquiry concerning this communication or earlier communications from the 

.# 

examiner should be directed to Matthew J. Sked whose telephone number is (571) 272- 

i 

7627. The examiner can normally be reached on Moih-Fri (8:00 am - 4:30 pm). 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David Hudspeth can be reached on (571) 272-7843. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). < 
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